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Abstract 

Noise is an unavoidable part of most measurements which can hinder a correct in- 
terpretation of the data. Uncertainties propagate in the data analysis and can lead to 
biased results even in basic descriptive statistics such as the central moments and cumu- 
lants. Expressions of noise-unbiased estimates of central moments and cumulants up to 
the fourth order are presented under the assumption of independent Gaussian uncertain- 
ties, for weighted and unweighted statistics. These results are expected to be relevant 
for applications of the skewness and kurtosis estimators such as outlier detections, nor- 
mality tests and in automated classification procedures. The comparison of estimators 
corrected and not corrected for noise biases is illustrated with simulations as a function 
of signal-to-noise ratio, employing different sample sizes and weighting schemes. 

1 Introduction 

Measurements generally provide an approximate description of real phenomena, because data acquisition 
compounds many processes which contribute, to a different degree, to instrumental errors (e.g., related 
to sensitivity or systematic biases) and uncertainties of statistical nature. While instrumental effects 
are addressed before data analysis, statistical uncertainties propagate in subsequent processing and can 
affect both precision and accuracy of results, especially at low signal-to-noise (S/N) ratios. Correcting 
for biases generated by noise can help the characterization and interpretation of weak signals, and in 
some cases improve a significant fraction of all data (e.g., the number of astronomical sources increases 
dramatically near the faint detection threshold, since there are many more sources far away than nearby) . 
In this paper, noise-unbiased estimates of central moments and cumulants up to the fourth order, 
which are often employed to characterize the shape of the distribution of data, are derived analytically. 
Some of the advantages of these estimators include the ease of computation and the ability to encapsulate 
important features in a few numbers. Skewness and kurtosis measure the degree of asymmetry and 
peakedness or weight of the tails of the distribution, respectively, and they are useful for the detection of 



outliers, the assessment of departures from normality of the data (D'Agostino 1986), the classification 
of light variations of astronomical sources (Rimoldini 2013a I and many other applications. Various 



estimators of skewness and kurtosis are available in the literature (e.g., Moors et al. . 1996 Hosking 1990 



Groeneveld & Meeden 1984 Bowley 1920), some of which aim at mitigating the sensitivity to outliers 



of the conventional formulations. On the other hand, robust measures might miss important features of 
signals, especially when these are characterized by outliers (as in astronomical time series where stellar 
bursts or eclipses from binary systems represent rare events in the light curve) and weighting might help 
distinguish true outliers from spurious data (employing additional information such as the accuracy of 
each measurement), so the traditional forms of weighted central moments and cumulants are employed 
in this work. 
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Moments are usually computed on random variables. Herein, their application is extended to data 
generated from deterministic functions and randomized by the uneven sampling of a finite number of 
measurements and by their uncertainties, whereas the corresponding 'population' statistics are defined in 
the limit of an infinite regular sampling with no random or systematic errors. This scenario is common in 
astronomical time series, where measurements are typically non-regular due to observational constraints, 
they are unavoidably affected by noise, and sometimes also not very numerous: all of these aspects 
introduce some level of randomness in the characterization of the underlying signal of a star. 

While the effects of sampling and sample size on time series are studied in Rimoldini (2013a|b), this 



work addresses the bias, precision and accuracy of estimators when measurements are affected (mostly) 
by Gaussian uncertainties. Bias is defined as the difference between expectation and population values 
and thus expresses a systematic deviation from the true value. Precision is described by the dispersion 
of measurements, while accuracy is related to the distance of an estimator from the true value and thus 
combines the bias and precision concepts (e.g., accuracy can be measured by the mean square error, 
defined by the sum of bias and uncertainty in quadrature) . 

Noise-unbiased expressions are provided for the variance, skewness and kurtosis (central moments and 
cumulants), weighted and unweighted, assuming Gaussian uncertainties and independent measurements. 
The dependence of noise-unbiased estimators on S/N is illustrated with simulations employing different 
sample sizes and two weighting schemes: the common inverse-squared uncertainties and interpolation- 



based weights as described in ( | Rimoldini 2013a). The latter demonstrated a significant improvement in 
the precision of weighted estimators at the high S/N end. 

This paper is organized as follows. The notation employed throughout is defined in Sec. [2] followed 
by the description of the method to estimate Gaussian-noise unbiased moments in Sec.[3j Noise-unbiased 
estimates of moments and cumulants (biased and unbiased by sample-size) are presented in Sections ffl 
and[5j in both weighted and unweighted formulations, and the special case of error-weighted estimators 
is presented in Sec. pi The noise-unbiased estimators are compared with the uncorrected (noise-biased) 
counterparts with simulated signals as a function of S/N ratio in Sec. [71 including weighted and un- 
weighted schemes and two different sample sizes. Conclusions are drawn in Sec. [8j followed by detailed 
derivations of the noise-unbiased estimators in App. [M 

2 Notation 

For a set of n measurements x = (xi, X%, ..., x n ), the following quantities are defined. 

(i) Population central moments fi r = ((x — /i) r ) with mean /u, = (x), where (.) denotes expectation, 



and cumulants K2 = £*2, K 3 = ^3, K 4 = A*4 — 3yit| (e.g., Stuart & Ord 1969). 

(ii) The sum of the p-th power of weights is defined as V p — ^, , W?. 

(iii) The mean 9 of a generic set of n elements 6i associated with weights W{ is 6 = X)"=i W fii/V\- 
(iv) Sample central moments m r — X>j=i w i( x i ~ x) r /V\ and corresponding cumulants k r . 

(v) Sample-size unbiased estimates of central moments Mi and cumulants Ki, i.e., (Mi) = /ii and 

(Ki) = K{. 

(vi) The standardized skewness and kurtosis are defined as g\ = k^/k 2 , 172 = k^/k^, G\ = K^,/K 2 , 
and G2 = K4/K2, with population values 71 = n^/n 2 and 72 = k^/k^. G\ and G2 satisfy 
consistency (for n — > 00) but are not unbiased in general (e.g., see|Heijmans| 1999, for exceptions). 



(vii) Noise-unbiased estimates of central moments and cumulants are denoted by an asterisk superscript. 

(viii) No systematics or other instrumental errors are considered herein and uncertainties are often referred 
to as errors. 

(ix) Statistics weighted by the inverse-squared uncertainties are called 'error-weighted' for brevity and 



interpolation-based weights computed in phase (Rimoldini 2013a) are named 'phase weights 
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3 Method 

The goal is to derive an estimator T*(x, e) as a function of observables (measurements x with correspond- 
ing uncertainties e) which is unbiased by the noise in the data, i.e., such that the expectation (T*(x, e)) 
equals the estimator T(£) in terms of the true (unknown) values £ aimed at by the measurements. 

The noise-unbiased estimator T*(x, e) is obtained with the following procedure and assumptions. If 
n independent measurements x are associated with independent Gaussian uncertainties e, the expected 
value (T(x)) of the estimator T(x) is evaluated from measurements x and the joint probability density 
p(x|£, e), for given true values £ and measurement uncertainties e: 



(T(x))= I T(x')p(x'|£,e)dV 



where 



P(x'|£,e) =II /o- 6Xp 



i=l 



27rei 



2e? 



21 



(1) 



(2) 



As shown in App. \M the expectation (T(x)) of the estimators considered herein can be decomposed as 

<T(x)>=T(0 +/(€,*)• (3) 

Thus, the noise-free estimator T(£) — (T(x)) — /(£, e) can be estimated in terms of measurements x and 
uncertainties e by the noise-unbiased estimator T*(x, e) = T(x) — /*(x, e), where (/*(x, e)) = /(£, e) and, 
by definition, (T*(x, e)) = T(£). The /*(x, e) term is derived first by computing /(£, e) = (T(x)) - T(£) 
and then by replacing terms depending on £ in /(£, e) with terms as a function of x which satisfy the 
requirement (/*(x, e)) = /(£, e) (see App. [A]) . A property often used in the following sections is that 
a noise-unbiased linear combination of N estimators is equivalent to the linear combination of noise- 
unbiased estimators: 



N 

E ciT *( x ) 


* N 

= E 



>^*(x), (4) 

where the coefficients c* are independent of the measurements x. 

4 Gaussian-noise unbiased sample moments and cumulants 

Weighted sample central moments unbiased by Gaussian uncertainties, such as the variance m^, skewness 
7713, kurtosis m\ and the respective cumulants about the weighted mean x* = x are derived assuming 
independent measurements x,, uncertainties Ci and weights Wi, as described in full detail in App.lX] They 
are defined as follows: 
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M _ ,n* -3(m 2 )*. 
By definition, the above expressions satisfy 



(5) 
(6) 

(7) 

(8) 
(9) 
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The unweighted forms can be obtained by substituting w; — 1 (for all i) and V p — n (for all p) in all 
terms, leading to: 

m 2 =m 2 - —^ 2^e l= k 2 

4 = 1 

3(n-2) -A 2 _ 

m 3 = m 3 . >> e * ^ - ar) = fc- 

i=l 



m 4 = 7714 



6(n-2) 

77 2 

4 



Yl e ? (^ _ x y 



i=l 



(mg)* = (m?.) 2 - -^ ^ e 2 (^ - x) 2 + 



: 3 


(12) 


6raJ-A 2 3(n-2) 2 A 4 3 /-A 2 \ 
n 2 2v c * n 3 2^ e * n i\2^^ ) 


(13) 


2(n-2) " 2 / " 2 \ 2 

i=l \i=l / 


(14) 




(15) 



k* A = m\ — 3 (m 2 )*. 

5 Gaussian-noise and sample-size unbiased moments and cumu- 
lants 

The estimates of weighted central moments which are unbiased by both sample-size and Gaussian uncer- 
tainties, such as the variance M 2 , skewness Mg, kurtosis Ml and the respective cumulants, are defined 
in terms of the noise-unbiased sample estimators as follows: 

K=^^r F rn* 2 = KZ (16) 

v 1 — v 2 

M *z= V?-3vlv 2 + 2V 3 mt = K * (1?) 

r V^Vi 4 - 3V?V 2 + WVs + 3Vg - 3V 4 ) , 

4 (v 2 - v 2 )(v* - w 2 v 2 + 8V1V3 + 3y 2 2 - 6V4) ™ 4 

3V?(2V?v a - 214V3 - 3v 2 2 + 3V4) , 2 y (18) 



K* 



(V, 2 - V 2 ){V? - 6V 2 V 2 + 8V!V 3 + 3V 2 - W A ) 
VfiVf-W^ + W 2 ) 



1 (Vf-V^iVf-GV^ + SViVz+Wi-eVi) 4 

^. (10) 



w 2 (vt-2v 2 v 2 + w 1 v s -w 2 ) , 2,* 



(^i 2 - Va)(Vi - 6V?V2 + 8V1V3 + 3F 2 2 - 6V4) 



The derivation of the sample-size unbiased weighted estimators is described in Rimoldini (2013b). The 



corresponding unweighted forms can be achieved by direct substitution V p — n for all p, leading to: 



n 
M 2 = m 2 — M 2 

n — 1 n 



n 

-E e ? = A i ( 20 ) 



n 2 3 

^3=7 ttt ^m%=M z T ^2eHxi-x)=K* 3 (21) 

(n-l)(n-2) n ~ l l^i 

w* n(n 2 -2n + 3) „ 3n(2n — 3) . 2 ,* ,__. 

M* = 7 -ti T7 — - — r ml - t -^ r-/ r (mi)* (22) 

4 (n-l)(n-2)(n-3) 4 (n - l)(n - 2)(n - 3) v 2y v y 

r^* n 2 (n+l) 3n 2 2 

^4 = 7 1V 0U ^ ml - — (m 2 2 )*. 23 

(n — l)(n— 2J(n — 3) (n — 2)(n — 3) 
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6 Special cases 

If weights are related to measurement errors as Wi = 1/ef , the noise-unbiased weighted sample moments 
and cumulants reduce to the following expressions: 



m 2 



m 3 
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(24) 
(25) 

(26) 

(27) 
(28) 



In the case of constant errors, i.e., e^ = eo for all i, some of the unweighted estimators are equivalent 
or similar to their noise-unbiased counterparts: 



Skewness: k^ — fc 3 and K£ = K3 
Kurtosis: k\ « k^ and K\ = K 4l 



(also nij = 7713 and M| = M3), 



where the approximation k\ sa fc 4 holds for large values of n or SyiV ratios since 

K-ki _Qel (k* 2 + k 2 ) 6 [1 + 2 (5/7V) 2 ] 



"2 



nfc| 



n [1 + {S/N)- 



l2 ' 



(29) 
(30) 



(31) 



considering that, for constant errors, X^ e i/ n = e o an< ^ (^/-^O 2 ~ ^2/ e o ~ ^/e 2 , — 1- For sample 
cumulants up to the fourth order, only the variance depends strongly on noise. However, this is an 
important estimator because it is often involved in definitions of standardized skewness (g\ and G\) and 
kurtosis (g 2 and G 2 ) as follows: 



.91 

.92 



; /; 3 / 2 
«3/«2 > 



K 3 /K? 2 



fc 4 /fcf, G 2 =K 4 /Kl 

For consistency with the above definitions, the noise-unbiased equivalents are defined as 

gl = kl/(k* 2 f/*, G\ = Kl/{K*fl\ 
k*J{klf, Gl = K*J{K* 2 f, 



.92 



(32) 
(33) 

(34) 
(35) 



although the truly noise-unbiased expressions should have been computed on the ratios in Eqs ( 32 )-(|33 



The application of Eqs (34 1— ( 35 ) should generally be restricted to larger samples (e.g., n > 50) with S/N 



ratios greater than a few, in order to avoid non-positive values of k 2 or K 2 . 

7 Estimators as a function of signal-to-noise ratio 

Noise-biased and unbiased estimators are compared as a function of signal-to-noise ratio S/N with sim- 
ulated data and different weighting schemes for specific signals, sampling and error laws. The values of 
the population moments of the continuous simulated periodic 'true' signal £(</>) are computed averaging 
in phase 4> as follows: 



jJLr 
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2^ 



2,T 
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(36) 
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7.1 Simulation 



Simulated signals are described by a sinusoidal function to the fourth power, which has a non-zero 
skewness and thus makes it possible to evaluate the precision and accuracy of the skewness standardized 
by the estimated variance without simply reflecting the accuracy of the variance. The S/N level is 
evaluated by the ratio of the standard deviation J]X2 of the true signal £(<fi) and the root of the mean of 
squared measurement uncertainties q (assumed independent of the signal). The signal £((/>) is sampled 
n = 100 and 1000 times at phases <pi randomly drawn from a uniform distribution, while the S/N ratio 
varies from 1 to 1000 and determines the uncertainties a of measurements x% as follows: 



£(</>)= A sin 4 
e~JV(6,e?) for & = £(&) and & ~ W(0, 2tt) 

; = (1 + Pi ) M 2 / (S/N) 2 for Pl ~ W(-0.8, 0.8) 



(37) 
(38) 
(39) 



where the i-th. measurement X{ is drawn from a normal distribution Af(d, ef) of mean £, and variance ef . 
The latter is defined in terms of a variable pj randomly drawn from a uniform distribution U(— 0.8, 0.8) 
so that measurement uncertainties vary by up to a factor of 3 for a given /12 and S/N ratio. Simulations 
were repeated 10 4 times for each S/N ratio (for n = 100 and 1000). 

The dependence of weighted estimators on sample size and the corresponding unbiased expressions 
were presented in Rimoldini ( 2013b[ ). Herein, only large sample sizes are employed so that sample-size 
biases are negligible with respect to the ones resulting from small S/N ratios. A sample signal and 
simulated data are illustrated in Fig. [I] for n = 100 and S/N — 2. The reference population values of the 
mean, variance, skewness and kurtosis of the simulated signal are listed in table 1 of 

Error weights are defined by w 
phase-sorted data: 



iA?, 



while phase weights follow Rimoldini 



Rimoldini (2013b) 



(2013a), assuming 



Wi = h(S/N\a, b) 



En 1 

w [ = 4>i+i - 4>i-i 

w[ — 02 - 4>n + 2?T 

w' n = 4>i - 4> n -i + 2n 
h(S/N\a,b) = 



[l-h(S/N\a,b)] 



En 
.7=1 ' 



Vie (l,n) 



1 + e -(S/N-a)/b 



Vie (2,n-l) 



for a, b > 0. 



(40) 

(41) 
(42) 
(43) 

(44) 



Weighting effectively decreases the sample size, since more importance is given to some data at the 
expense of other ones and results depend mostly on fewer 'relevant' measurements (e.g., weighting by the 
inverse-squared uncertainties can worsen precision at high S/N levels) . Weighted procedures are desirable 
when the dispersion and bias of estimators from an effectively reduced sample size are smaller than the 
improvements in precision and accuracy (e.g., weighting by inverse-squared uncertainties can improve 
both precision and accuracy at low S/N ratios). Also, weighting might exploit correlations in the data to 



improve precision, as it is shown employing phase weights (Rimoldini 2013a). Since correlated data do 



not satisfy the assumptions of the expressions derived herein, their application might return biased results. 
However, small biases could be justified if improvements in precision are significant and, depending on 
the extent of the application, larger biases could be mitigated with mixed weighting schemes, such as the 



one described by Eqs (40|-(44) 



Estimators derived herein assume a single weighting scheme and combinations of estimators (like the 
variance and the mean in the standardized skewness and kurtosis) are expected to apply the same weights 
to terms associated with the same measurements. The function h(S/N\a,b) constitutes just an example 
to achieve a mixed weighting scheme: tuning parameters a, b offer the possibility to control the transition 
from error- weighted to phase- weighted estimators (in the limits of low and high S/N, respectively) and 
thus reach a compromise solution between precision and accuracy for all values of S/N, according to the 
specific estimators, signals, sampling, errors, sample sizes and their distributions in the data. 
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S/N = 2, n = 100 




0.4 0.6 

Phase (<|) / 2k) 

Figure 1: A simulated signal of the form of sin <f> (blue curve) is irregularly sampled by 100 measurements 
(denoted by triangles) with S/N = 2. 



7.2 Results 



The results of simulations are illustrated for sample estimators, since the conclusions in Rimoldini (2013b) 



suggested that phase-weighted sample estimators can be more accurate and precise than the sample-size 
unbiased counterparts in most cases, especially for large sample sizes as considered herein. 

Figure [2] illustrates the sample mean in the various scenarios considered in the simulations: sample 
sizes of n = 100 and 1000, unweighted and with different weighting schemes (error- weighted, phase- 
weighted and combined error-phase weighted). While accuracy is the same in all cases, the best precision 
of the mean is achieved employing phase weights (including the low S/N end, unlike other estimators) . 

Figures [3 -{16 compare noise-biased {'uncorrected') and noise-unbiased {'corrected') estimators as a 
function of S/N, evaluating the following deviations from the population values: 



m 2 /n2 - 1 



vs iri^l ' H% — 1, 



/ 3/2 * / 3/2 

"V/V -7i vs "VAV -7i) 



1714 /nl 



h/t4 - 72 



3-72 vs ml/nl 



'72, 



vs kt//4-j2, 



.9i - 7i 

ffl4/m 2 
32 - 72 



vs g x -7i, 
3 — 72 vs m* A /{m* 2 Y 
vs 3*2 -72, 



'72, 



(45) 
(46) 
(47) 
(48) 



in both weighted and unweighted cases, for n — 100 and 1000. The dependence on n is described in more 
details in Rimoldini (2013b I. Estimators standardized by both true and estimated variance are presented 



to help interpret the behaviour of the ratios from their components. 

All figures confirm that 'corrected' and 'uncorrected' estimators have similar precision and accuracy at 
high S/N levels (typically for S/N > 10). Noise-unbiased estimators are found to be the most accurate in 
all cases and over the whole S/N range tested. Their precision is generally similar to the noise-uncorrected 
counterparts, apart from estimators standardized by the estimated variance, such as g\, g 2 and 1714/1712, 
for which the uncorrected version can be much more precise (although biased) for S/N < 2, typically. 
As expected, the precision of estimators employing n — 1000 measurements per sample was greater than 
the one obtained with sample sizes of n = 100. 

Weighting by the inverse of squared measurement errors made the estimators slightly less precise at 
high S/N ratios, but more precise and accurate at low S/N levels (except for the mean). 

Weighting by phase intervals led to a significant improvement in precision of all estimators in the limit 
of large S/N ratios and a reduction of precision at low S/N (apart from the case of the mean). Tuning 
parameters such as a = 2 and b = 0.3 in Eq. (40 1 were able to mitigate the imprecision at low S/N 
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reducing to the error-weighted results, which appeared to be the most accurate and precise in the limit 
of low S/N ratios (in these simulations) . This solution might provide a reasonable compromise between 
precision and accuracy of all estimators, at least for S/N > 1. 

Figures [5}j8] show that the skewness moment 7713 is quite unbiased by noise, while the standardized 
version g\ is underestimated at high S/N because of the overestimated variance ni2 (as shown in Figspjjfl]). 
While the accuracy of g\ deteriorates at low S/N, its precision is much less affected by noise. 

The kurtosis moment 7714 (Figsp) 12) is less precise and accurate than the noise-unbiased equivalent, 



and its normalization by the squared variance reduces dramatically its inaccuracy and imprecision (since 
m2 and mi exhibit a similar trend as a function of S/N). The kurtosis cumulant k±, instead, is much 
closer to its noise-unbiased counterpart, as shown in Figs |13Hl6[ The normalization of k± by the squared 
variance improves its precision at the cost of lower accuracy for S/N < 10: the bias of gi is similar to 
(greater than) the precision of g\ for n — 100 (n = 1000). 

The lower the S/N level is, the less precise estimators are and the noise-unbiased variance can be 
underestimated (and even become non-positive). Thus, the skewness and kurtosis estimators standardized 



by k\ or K.\, as in Eqs (34)— (35 1, should be avoided in circumstances that combine small sample sizes 
(up to a few dozens of elements) and low S/N ratios (of the order of a few or less). 

Figures related to moments and cumulants of irregularly sampled sinusoidal signals are very similar 
to the ones presented herein, with the exception of g\, which would have a similar precision but with no 
bias, as a consequence of the null skewness of a sinusoidal signal (since the mean of A3 estimates is zero, 
they are not biased by the standardization with an overestimated noise-biased variance). 

From the comparison of noise-biased and unbiased estimators with different weighting schemes, it ap- 
pears that, for large sample sizes, noise-unbiased phase-weighted estimators are usually the most accurate 
for S/N > 2 (apart from the special cases of standardized skewness and kurtosis when their true value 
is zero). For noisy signals (e.g., S/N < 2), error weighting seems the most appropriate, at least with 
Gaussian uncertainties, thus noise-unbiased error-phase weighted estimators can provide a satisfactory 
compromise in general. Further improvements might be achieved by tuning parameters better fitted to 
estimators and signals of interest, in view of specific requirements of precision and accuracy. 

8 Conclusions 

Exact expressions of noise-unbiased skewness and kurtosis were provided in the unweighted and weighted 
formulations, under the assumption of independent data and Gaussian uncertainties. Such estimators 
can be particularly useful in the processing, interpretation and comparison of data characterized by low 
S/N regimes. 

Simulations of an irregularly sampled skewed periodic signal were employed to compare noise-biased 
and unbiased estimators as a function of S/N in the unweighted, inverse-squared error weighted and 
phase-weighted schemes. While noise-unbiased estimators were found more accurate in general, they 
were less precise than the uncorrected counterparts at low S/N ratios. The application of a mixed 
weighting scheme involving phase intervals and uncertainties was able to balance precision and accuracy 
on a wide range of S/N levels. The effect of noise-unbiased estimators and different weighting schemes 



on the characterization and classification of astronomical time series is described in Rimoldini (2013a). 
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Figure 2: Sample mean for S/N > 1 and n — 100, 1000: unweighted on the top-left hand side, weighted by 
the inverse of squared measurement errors on the top-right hand side, and weighted by phases and errors, 



according to Eq. (40 1, with different parameter values, as specified above the lower panels. Shaded areas 



encompass one standard deviation from the average of the distribution of the mean employing simulations 



defined by Eqs (37)— (39) 
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Figure 3: Noise-biased ('uncorrected') versus noise-unbiased ('corrected') sample variance for S/N > 1 
and n = 100: unweighted on the top-left hand side, weighted by the inverse of squared measurement 



errors on the top- right hand side, and weighted by phases and errors, according to Eq. (40), with different 



parameter values, as specified above the lower panels. Shaded areas encompass one standard deviation 



from the mean of the distribution of the variance employing simulations defined by Eqs (37 )— ( 39 ) 
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Figure 4: Noise-biased ('uncorrected') versus noise-unbiased ('corrected') sample variance for S/N > 1 
and n = 1000: unweighted on the top-left hand side, weighted by the inverse of squared measurement 



errors on the top- right hand side, and weighted by phases and errors, according to Eq. (40), with different 



parameter values, as specified above the lower panels. Shaded areas encompass one standard deviation 



from the mean of the distribution of the variance employing simulations defined by Eqs ( 37 1— ( 39 ) 
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Figure 5: Noise-biased ('uncorrected') versus noise-unbiased ('corrected') sample skewness for S/N > 1 
and n = 100: unweighted in the upper panels and weighted by the inverse of squared measurement errors 
in the lower panels. Shaded areas encompass one standard deviation from the mean of the distribution 



of the skewness employing simulations defined by Eqs ( 37 )— ( 39 1 
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Figure 6: Noise-biased ('uncorrected') versus noise- unbiased ('corrected') sample skewness for S/N > 1 



and n — 100, weighted by phases and errors, according to Eq. (40), with different parameter values 



as specified above each panel. Shaded areas encompass one standard deviation from the mean of the 



distribution of the skewness employing simulations defined by Eqs (37H39). 
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Figure 7: Noise-biased ('uncorrected') versus noise- unbiased ('corrected') sample skewness for S/N > 1 
and n — 1000: unweighted in the upper panels and weighted by the inverse of squared measurement errors 
in the lower panels. Shaded areas encompass one standard deviation from the mean of the distribution 



of the skewness employing simulations defined by Eqs ( 37 )— ( 39 1 
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Figure 8: Noise-biased ('uncorrected') versus noise- unbiased ('corrected') sample skewness for S/N > 1 



and n — 1000, weighted by phases and errors, according to Eq. (401, with different parameter values 



as specified above each panel. Shaded areas encompass one standard deviation from the mean of the 



distribution of the skewness employing simulations defined by Eqs (37)-(39). 
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Figure 9: Noise-biased ( 'uncorrected'') versus noise- unbiased ( 'corrected') sample kurtosis MJ*' for S/N > 
1 and n — 100: unweighted in the upper panels and weighted by the inverse of squared measurement errors 
in the lower panels. Shaded areas encompass one standard deviation from the mean of the distribution 



of the kurtosis employing simulations defined by Eqs (37 1— ( 39 ) 
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Figure 10: Noise-biased ('uncorrected') versus noise-unbiased ('corrected') sample kurtosis M^* for 
S/N > 1 and n — 100, weighted by phases and errors, according to Eq. (40 1, with different parame- 



ter values, as specified above each panel. Shaded areas encompass one standard deviation from the mean 



of the distribution of the kurtosis employing simulations defined by Eqs ( 37 )-( 39 ) 
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Figure 11: Noise-biased ('uncorrected') versus noise- unbiased ('corrected') sample kurtosis M^ for 
S/N > 1 and n — 1000: unweighted in the upper panels and weighted by the inverse of squared mea- 
surement errors in the lower panels. Shaded areas encompass one standard deviation from the mean of 



the distribution of the kurtosis employing simulations defined by Eqs ( 37 1— ( 39 ) 
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Figure 12: Noise-biased ('uncorrected') versus noise-unbiased ('corrected') sample kurtosis M^* for 
S/N > 1 and n — 1000, weighted by phases and errors, according to Eq. (40), with different param- 



eter values, as specified above each panel. Shaded areas encompass one standard deviation from the 



mean of the distribution of the kurtosis employing simulations defined by Eqs ( 3T I— ( 39 ) 
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Figure 13: Noise-biased ('uncorrected') versus noise-unbiased ('corrected') sample kurtosis K± for 
S/N > 1 and n — 100: unweighted in the upper panels and weighted by the inverse of squared measure- 
ment errors in the lower panels. Shaded areas encompass one standard deviation from the mean of the 



distribution of the kurtosis employing simulations defined by Eqs ( 37 )— ( 39 1 
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Figure 14: Noise-biased ('uncorrected') versus noise-unbiased ('corrected') sample kurtosis i<Q for 
Sy^V > 1 and n — 100, weighted by phases and errors, according to Eq. (40 1, with different parame- 



ter values, as specified above each panel. Shaded areas encompass one standard deviation from the mean 



of the distribution of the kurtosis employing simulations defined by Eqs ( 37 )-( 39 ) 
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Figure 15: Noise-biased ('uncorrected') versus noise-unbiased ('corrected') sample kurtosis iQ for 
S/N > 1 and n = 1000: unweighted in the upper panels and weighted by the inverse of squared mea- 
surement errors in the lower panels. Shaded areas encompass one standard deviation from the mean of 



the distribution of the kurtosis employing simulations defined by Eqs ( 37 1— ( 39 ) 
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Figure 16: Noise-biased ('uncorrected') versus noise-unbiased ('corrected') sample kurtosis i<Q for 
S/N > 1 and n — 1000, weighted by phases and errors, according to Eq. (40), with different param- 



eter values, as specified above each panel. Shaded areas encompass one standard deviation from the 



mean of the distribution of the kurtosis employing simulations defined by Eqs (37 1— (39) 
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A Derivation of noise- unbiased moments 

The derivations presented in this Appendix involve weighted estimators under the assumption of inde- 
pendent measurements, uncertainties and weights. Definitions and some of the relations often employed 
herein are listed below. 

• For brevity, m r — m r (x), and ^2 t and ]X are implied to involve all (from the 1-st to the n-th) 
terms, unless explicitly stated otherwise. 

• The following integral solutions are often employed: 
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(49) 



• The expected value (m) of a generic estimator m(x) = J2i a i x i E«i ^i x ] J2k^i,j °k x k E«ij,fc ^l x i 
of independent data with Gaussian uncertainties is computed as follows 
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• The results of the following expressions are employed: 
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E* w l e i E^ 4 w iO Efe^ij w fc6 
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A.l Outline of results 

The expressions of the elements pursued along the derivation of noise-unbiased estimators (detailed in 
Sec. A. 2) are summarized below, following the notation introduced in Sections [2] and [3J 
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A. 2 Detailed computations 
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Since the right-hand side of Eq. (112) does not depend on £, /*(x, e) = /(£, e) = (to2(x)) — w&2(£) 



and the expression of the noise-unbiased sample variance m* 2 = 77i2(x) — /*(x, e) is found immediately: 
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In order to remove the dependence on £ in Eqs ( 113 )-( 115 ), /*(x, e) is derived from /(£, e) such that 
(/*(x, e)) = /(£, e). In the case of skewness, /(£, e) has the following form: 
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1201 equals Eq. (117), it follows 
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and the noise- unbiased sample skewness m^ — m 3 (x) — /*(x, e) is 
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For the kurtosis moment and cumulant, /(£, e) involves ^-dependent terms of the form J^ c; (^ — M . 
The computation of ( ^ c%{xi — x) 2 ) leads to: 
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Thus, each of the terms of the form ^ c^ (^ — £J in Eqs ( 114 1— ( 115 ) can be replaced by the expression 
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and the noise-unbiased sample kurtosis moment m\ and cumulant k\ are found as follows: 
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